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Abstract 

One of the earliest and best-known application of the probabilistic method is 
the proof of existence of a 2 log n- Ramsey graph, i.e., a graph with n nodes that 
contains no clique or independent set of size 21ogn. The explicit construction 
of such a graph is a major open problem. We show that a reasonable hardness 
assumption implies that in polynomial time one can construct a list containing 
polylog(n) graphs such that most of them are 2 log n- Ramsey. 



1 Introduction 

A A;- Ramsey graph is a graph G that has no clique of size k and no independent set of 
size k. It is known that for all sufficiently large n, there exists a 2 log n- Ramsey graph 
with n vertices. The proof is nonconstructive, but of course such a graph can be built 
in exponential time by exhaustive search. A major line of research is dedicated to 
constructing a fc-Ramsey graph with n vertices with k as small as possible and in 
time that is bounded by a small function in n, for example in polynomial time, or 
in quasi-polynomial time, DTIME[2 polylog(n )]. Till recently, the best polynomial-time 
construction of a fc-Ramsey graph with n vertices has been the one by Frankl and 
Wilson |FW81] . for k = 2 0<K ^ Xogn \ Using deep results from additive combinatorics 
and the theory of randomness extractors and dispersers, Barak, Rao, Shaltiel and 
Wigderson |BRSW06] improved this to k = 2^ osn ^° m . Notice that this is still far off 
from k = 2 log n. 

As usual when dealing with very difficult problems, it is natural to consider easier 
versions. In this case, one would like to see if it is possible to efficiently construct a 
small list of n- vertices graphs with the guarantee that one of them is 2 log n- Ramsey. 
The following positive results hold. 
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Theorem 1.1 There exists a quasipolynomial-time algorithm that on input l n returns 
a list with 2 log n graphs with n vertices, and most of them are 2 log n-Ramsey. In fact, 
since in quasipolynomial time one can check whether a graph is 2\ogn-Ramsey, the 
algorithm can be modified to return one graph that is 2\ogn-Ramsey. 

Theorem 1.2 Under a reasonable hardness assumption H , there exists a constant 
c and a polynomial-time algorithm that on input 1" returns a list with log c n graphs 
with n vertices, and most of them are 2\ogn-Ramsey. 

The proofs of these two results use basic off-the-shelf derandomization techniques. 
The proof (one of them) of Theorem 11.11 notices that the probabilistic argument that 
shows the existence of 2 log n-Ramsey graphs only needs a distribution on the set 
of n- vertices graphs that is 2 log 2 n- wise independent. There exist such distributions 
whose support have the following properties: (a) the size is 

2 0(iog 3 „) and ( b ) it 

can 

be indexed by strings of size 0(log 3 n). Therefore if we make an exhaustive search 
among these indeces, we obtain the result. 

Theorem 11.21 uses a pseudo-random generator g that can fool NP-predicates. The 
assumption H, which states that there exists a function in E that, for some e > 0, 
requires circuits with SAT gates of size 2 en , implies the existence of such pseudo- 
random generators. Then going back to the previous proof, it can be observed that 
the property that an index corresponds to a graph that is not 2 log n- Ramsey is an 
NP predicate. Since most indeces correspond to graphs that are 2 log n-Ramsey, 
it follows that for most seeds s, g(s) is also 2 log n-Ramsey. Therefore, it suffices 
to make an exhaustive search among all possible seeds. Since a seed has length 
O (log | index |) = O (log log 3 n), the result follows. 

Theorem 11.21 can be strengthened to produce a list of concise representations of 
graphs. A string t is a concise representation of a graph G = (V, E) with V = 
{1, . . . , n} if there is an algorithm A running in time poly(logn) such that for every 
u G V,v G V, A(t,u,v) = 1 if (u,v) G E and A(t,u,v) = if (u,v) £ E. With 
basically the same proof as that of Theorem 11.21 one can show the following result. 

Theorem 1.3 Under a reasonable hardness assumption H, there exists a constant 
c and an algorithm running in time poly (log n) that on input n (written in binary 
notation) returns a list t\, . . . ,t\ og c n , and most elements of the list are concise repre- 
sentations of 2 log n-Ramsey graphs. 

Theorem 11.11 is folklore. It appears implicitely in the paper of M. Naor |Nao92j . 
Theorem 11.21 may also be known, but we are not aware of any published statement 
of it. Fortnow in the Computational Complexity blog |For06j and Santhanam at 
the 2011 Bertinoro seminar on Ramsey theory |Sanllj mention a weaker version of 
Theorem 11.21 in which the same hardness assumption is used but the size of the list 
is polynomial instead of polylogarithmic. This motivated us to write this note. 

Section H] contains some additional remarks. First we analyze the implication 
of Theorem 11.21 when plugged in a construction of M. Naor [Nao92j that builds a 
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fc-Ramsey graph from a list of graphs, most of which are fc'-Ramsey graphs, which 
is exactly what Theorem 11.21 delivers. We notice that the parameters obtained in 
this way are inferior to the result of Barak et al. [BRSW06]. Secondly, we consider 
the problem of explicit lower bounds for the van der Waerden Theorem, a problem 
which is related to the explicit construction of Ramsey graphs. We notice that the 
hardness assumption which derandomizes BPP implies lower bounds for the van der 
Waerden Theorem that match the non-constructive lower bounds obtained via the 
Lovasz Local Lemma. The original proof of the Lovasz Local Lemma does not seem 
to yield this result. Instead we use a proof of Gasarch and Haeupler |GHllj . based 
on the methods of Moser [Mos09j and Moser and Tardos [MT10] . 

2 The hardness assumption 

The hardness assumption needed in theorem 11.21 is that there exists a function / 
computable in E (where E = (J DTIME[2 cn ]) that, for some e > 0, cannot be 
computed by circuits of size 2 en that also have SAT gates (in addition to the standard 
logical gates). More formally let us denote by Cj AT (n) the size of the smallest circuit 
with SAT gates that computes the function / for inputs of length n. 

Assumption H: There exists a function / in E such that, for some e > 0, for every 
n, C s f AT (n) > T n . 

Klivans and van Melkebeek |KvM02] , generalizing the work of Nisan and Wigder- 
son |NW94] and Impagliazzo and Wigderson [IW97] , have shown that, under as- 
sumption H, for every k, there is a constant c and a pseudo-random generator 
g : {0, i} cl °s n — y {o, l} ra , computable in time polynomial in n, that fools all n h - 
size circuits with SAT gates. Formally, for every circuit C with SAT gates of size 

k 

n , 

|Prob se{0il} do g „[C(g(s)) = 1] - Prob ze{0 ,i } n[C(z) = 1]| < l/n k . 

We note that assumption H is realistic. Miltersen |Mil01] has shown that it is 
implied by the following natural assumption, involving uniform complexity classes: 
for every e > 0, there is a function / 6 E that cannot be computed in space 2 en for 
infinitely many lengths n. 

3 Proofs 

Proof of Theorem 11.11 

Let us first review the probabilistic argument showing the existence of 2 log n- 
Ramsey graphs. A graph G with n vertices can be represented by a string of length 
(™) • If we take at random such a graph and fix a subset of k vertices, the probability 
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that the set forms a clique or an independent set is 2 w . The probability that this 
holds for some fc-subset is bounded by 

g) ■ 2-(*) +1 < (f ) k ■ 2"© +1 
= 2^t-(2)+i 

For k = 21ogn, the above expression goes to 0. Thus, for n large enough, the 
probability that a graph G is 21ogn-Ramsey is > 0.99. 

The key observation is that this argument remains valid if we take a distribution 
that is 21og 2 n-wise independent. Thus, we can take a polynomial p(X) of degree 
21og 2 n over the field GF[2 9 ], where q = log (™). To the polynomial p we associate 
the string p = p{a\)\ . . .p(a/«\)i, where ai, . . . , are the elements of the field and 

(p(a))i is the first bit of p(a). When p is random, this yields a distribution over 
strings of length Q) that is 2 log 2 n- wise independent. Observe that a polynomial p 
is given by a string of length n = (2 log 2 n + 1) log Q) = 0(\og 3 n). It follows that 

Prob pg { 0j i}n[p is 2 log n- Ramsey] > 0.99. 

In quasipolynomial time we can enumerate the graphs p, and 99% of them are 2 log n- 
Ramsey. I 

Note. By using an almost fc-wise independent distribution (see [NN93} 
IAGHR92] ) . one can reduce the size of the list to 2°( lo s n \ 

Proof of Theorem 11.21 and of Theorem 11.31 

Let p, p, n be as in the proof of Theorem 11.11 Thus: 

• p G {0, l} n represents a polynomial, 

• p is built from the values taken by p at all the elements of the underlying field, 
and represents a graph with n vertices, 

• n = 0(log 3 n). 

Let us call a string p good if p is a 2 log n- Ramsey graph. 

Checking that a string p is not good is an NP predicate. Indeed, p is not good iff 
3(z'i, . . . , 22 log n) £ [^] 21ogn [ vertices ii, . . . , Z2iogn i n P form a clique or an independent 
set]. The 3 is over a string of length polynomial in \p\ and the property in the right 
parentheses can be checked by computing 0(log 2 n) values of the polynomial p, which 
can be done in time polynomial in \p\. 

Assumption H implies that there exists a pseudo-random generator g : 
{0, i} cl °s n — y {o, l} n , computable in time polynomial in n, that fools all NP 
predicates, and, in particular, also the one above. Since 99% of the p are good, it 
follows that for 90% of the seeds s G {0, l} cl °g w , g(s) is good, i.e., for 90% of s, g(s) 
is 2 log n- Ramsey. Note that from a seed s we can compute g(s) and next g(s) in 
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time polynomial in n. If we do this for every seed s G {0, i} cl °s n ; we obtain a list 
with n c = 0(log 3c n) graphs of which at least 90% are 2 log n- Ramsey graphs. 

Theorem ll.3l is obtained by observing that {g(s) \ s G {0, l} cl °s n } i s a list that can 
be computed in poly (log n) time, and most of its elements are concise representations 
of 2 log n- Ramsey graphs. I 



4 Additional remarks 

4.1 Constructing a single Ramsey graph from a list of graphs 
of which the majority are Ramsey graphs 

M. Naor |Nao92] has shown how to construct a Ramsey graph from a list of m graphs 
such that all the graphs in the list, except at most am of them, are A;- Ramsey. We 
analyze what parameters are obtained, if we apply Naor's construction to the list of 
graphs in Theorem 11.21 

The main idea of Naor's construction is to use the product of two graphs G\ = 
(Vi,Ei) and G 2 = (V 2 ,E 2 ), which is the graph whose set of vertices is V\ x V 2 and 
edges defined as follows: there is an edge between (u\,u 2 ) and (t>i,t>2) if and only if 
(ui,Vi) G Ei or («! = Vi) and (u 2 ,v 2 ) G E 2 . Then, one can observe that if Gi is k\- 
Ramsey and G 2 is A^-Ramsey, the product graph, G\ x G 2 is fciA^-Ramsey. Extending 
to the product of multiple graphs G±,G 2 , . . . ,G m where each Gi is fcj-Ramsey, we 
obtain that the product graph is k\k 2 . . . fc m -Ramsey. 

If we apply this construction to a list of m graphs G\, G 2 , ■ ■ ■ , G m , each having 
n vertices and such that Probi[Gi is not k- Ramsey] < a, we obtain that the product 
of G\,G 2 , ■ ■ ■ ,G m is a graph G with iV = n m vertices that is t-Ramsey for t = 
n am^(\-oL)m, _ p Qr a <- x/logn, we have t < (2k) m . The list produced in Theorem 11.21 
has m = log c n, k = 2 log n, and one can show that a < 1/ log n. The product graph 
G has iV = 2 lo § Cnl °s n vertices and is t-Ramsey for t < 2 lo s c "' lo s lo s™+°( 1 ) < 2( 1 °g A 1 "' 3 , 
for some positive constant /3. 

Thus, under assumption H, there is a positive constant /3 and a polynomial time 
algorithm that on input 1^ constructs a graph with iV vertices that is 2^ logN ^ P - 
Ramsey. Note that this is inferior to the parameters achieved by the unconditional 
construction of Barak, Rao, Shaltiel and Wigderson [B RSW06] . 

4.2 Constructive lower bounds for the van der Waerden the- 
orem 

Van der Waerden Theorem is another classical result in Ramsey theory. It states that 
for every c and k there exists a number n such that for any coloring of {1, . . . ,n} 
with c colors, there exists k elements in arithmetic progression (fc-AP) that have the 
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same color. Let W(c, k) be the smallest such n. One question is to find a constructive 
lower bound for W(c, k). To simplify the discussion, let us focus on W(2,k). 

In other words, the problem that we want to solve is the following: 

For any k, we want to find a value of n — n(k) as large as possible and a 2- 
coloring of {1, . . . ,n} such that no k-AP is monochromatic. Furthermore, we want 
the 2-coloring to be computable in time polynomial in n. 

Gasarch and Haeupler |GH11] have studied this problem. They present a proba- 
bilistic polynomial time construction for n = ^—r 1 (i.e., the 2-coloring is obtained 

by a probabilistic algorithm running in 2°^ k ' time) and a (deterministic) polynomial 
time construction for n = 2(k ^ £) (i.e., the 2-coloring is obtained in determinis- 
tic 2°( fc//<E ) time). Their constructions are based on the constructive version of the 
Lovasz Local Lemma due to Moser |Mos09j and Moser and Tardos |MT10j . The 
probabilistic algorithm of Gasarch and Haeupler is "BPP-like", in the sense that it 
succeeds with probability 2/3 and the correctness of the 2-coloring produced by it 
can be checked in polynomial time. It follows that it can be derandomized under 
the hardness assumption that derandomizes BPP, using the Impagliazzo-Wigderson 
pseudo-random generator [IW97J. It is interesting to remark that the new proof by 
Moser and Tardos of the Local Lovasz Lemma is essential here, because the success 
probability guaranteed by the classical proof is too small to be used in combination 
with the Impagliazzo-Wigderson pseudo-random generator. 

We proceed with the details. 

We use the following hardness assumption H' (weaker than assumption H), which 
is the one used to derandomize BPP [IW97J. 

Assumption H'\ There exists a function / in E such that, for some e > 0, for 
every n, Cf(n) > 2 en . 

Impagliazzo and Wigderson [IW97] have shown that, under assumption H, for 
every k, there is a constant c and a pseudo-random generator g : {0, i| cl °s n — > {o, 1}" 
that fools all n fc -size circuits and that is computable in time polynomial in n. 

Proposition 4.1 Assume assumption H' . For every k, let n = n{k) = ^-r- — 1. 
There exists a polynomial-time algorithm that on input l n 2-colors the set {1, . . . ,n} 
such that no k-AP is monochromatic. 

Proof. The algorithm of Gasarch and Haeupler [GHllj . on input 1™, uses a random 
string z of size \z\ = n c , for some constant c, and, with probability at least 2/3, 
succeeds to 2-color the set {1, . . . ,n} such that no k-AP is monochromatic. Let us 
call a string z to be good for n if the Gasarch-Haeupler algorithm on input l n and 
randomness z, produces a 2-coloring with no monochromatic k-APs. Note that there 
exists a polynomial-time algorithm A that checks if a string z is good or not, because 
the Gasarch-Haeupler algorithm runs in polynomial time and the number of fc-APs 
inside {1, . . . , n} is bounded by n 2 /k. Using assumption H' and invoking the result 
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of Impagliazzo and Wigderson [IW97], we derive that there exists a constant d and a 
pseudo-random generator g : {0, l} dlo § n — y {(], l} nC such that 

Prob se{0il} dio g „[A(g(s)) = good for n] > 2/3 - 1/10 > 0. 

Therefore if we try all possible seeds s of length d log n, we will find one s such that 
g(s) induces the Gasarch-Haeupler algorithm to 2-color the set {1, . . . , n} such that 
no k-AP is monochromatic. I 
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